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Abstract 

Using Talagrand's concentration inequality on the discrete cube {0, l} m we show 
that given a real-valued function Z{x) on {0, l} m that satisfies certain monotonicity 
conditions one can control the deviations of Z(x) above its median by a local Lipschitz 
norm of Z at the point x. As one application, we give a simple proof of a nearly optimal 
deviation inequality for the number of ^-cycles in a random graph. 

1 Introduction and main results. 

In this paper we suggest a new way to use Talagrand's concentration inequality on the cube 
to control the deviations of Boolean functions that satisfy certain monotonicity conditions. 
As one application we prove a suboptimal deviation inequality for the count of /c-cycles in a 
random graph. 

Let X = {0, 1} and define a probability measure u on X by A*({1}) = p, A*({0}) = 1 — p. 
Consider a product space X m with a product probability measure P = // m . Given a function 
Z : X m — > K. and a point x = (xi, . . . , x m ) G X m we define 

Vi(x) = Z(x) - Z(xx, Xi-i, 0, x i+ i, x m ), 

and 

m 

v(x) = j2k 2 (*)- 

Note that VJ(x) = if x t = 0. 

Let us state the main result of this paper. 

Theorem 1 If Z(x) and Vi(x),i < m are non- decreasing in each coordinate then for any 
a£t and t > 0, 

F(Z(x) > a + ^V{x)t)¥(Z{x) < a) < e~ t/2 . (1.1) 
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To understand the statement of Theorem 1, we notice that a function (V(x)) 1 ^ 2 can be 
interpreted as a kind of discrete Lipschitz norm of Z locally at the point x. For example, 
since Z(x) is defined only on the vertices of m-dimensional cube, if one extends Z(x) linearly 
from the point x to its neighbours only along the coordinates where Xi = 1, then (V(x)) 1 ^ 2 
is the norm of that linear map. Indeed, if we denote the map by L : K m — > R, then 

m 

L { z ) = y^;Vi( x )(Zi ~ x i) + Z ( x )> 
i=l 

and ||L|| = (V(x)) 1 / 2 . 

The proof of Theoreml is based on Talagrand's concentration inequality on X m . In order 
to give more clear interpretation of (jl.lj) . let us compare it with some typical ways of using 
Talagrand's inequality. One common application is the following. Given a convex function 
/ : [0, l] m — > M with a Lipschitz norm 

ll fl , \f(x)-f(y)\ 

\\f\\L= SUp : : < OO, 

x,y£[0,l] m \ X Hi 

where the supremum is taken over x ^ y, the following inequality holds: 

P(/>a+||/|| L Vt)p(/<a) <e- t/2 . (1.2) 

If the function / is defined only on the vertices of the cube X m , then it is possible to state 
a similar result where one has to use a discrete analog of the Lipschitz norm. For example, 
the following deviation inequality holds (see PQ). For % < m we define x l G X m such that 
x l , = Xj for j ^ i and x\ ^ x iy and define 



= sup (£(/(*) -jV)) 2 ) V2 . 
i=i 

Then 

F(f>Mf+\\f\\ d Vt) <e-*/ 4 . (1.3) 

Both inequalities (jl.2j) and (jl.3|) use global Lipschitz condition to control the deviation of 
f(x). Theorem 1 suggests a possibility of using a local Lipschitz norm V(x) 1 ^ 2 at the point 
x, provided that the monotonicity conditions are satisfied. The reason why we compute the 
Lipschitz norm only in the direction of decreasing Z is because we control the deviation of 
Z above level a. Theorem 1 is similar in spirit to the ideas in jE], [E]>[H] ( see a l so references 
therein) , where the authors describe a way of using average Lipschitz norm of Z to control 
its deviations. 

One example when the monotonicity conditions are satisfied is the following. Let us 
consider a set of indices M. = {1,2, ... ,m} and a set of nonnegative numbers ac > 
indexed by the subsets C C M.. Consider the function Z{x) defined by 

Z(x) = J2 ac]J x i (1.4) 

CCM i&C 
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In this case the fact that ac's are non-negative implies that functions Z{x) and Vi(x),i < m 
are non- decreasing in each coordinate. Below we will consider the example of counting the 
number of fc-cycles in a random graph which can be represented in the form (|1.4J) and, thus, 
Theorem 1 is applicable. 

Consider a standard Erdos-Renyi model of a random graph G(n,p). Let V be a set of n 
vertices, m = Q) an d let E = {e 1; . . . , e m } denote a set of edges of a complete graph K n on 
n vertices. Given ) G X m } the fact that = 1 or describes that the edge 

&i is present or not present in the graph G(n,p) respectively. Let 

C k = |{ei i; . . . , e ik ) : G £, and {e ii; . . . , e ifc } form a fc-cycle j 
be a collection of all fc-cycles, and for e G ^ let 

C fe (e) = {c G Cfc : e 6 c} 
be a set of all fc-cycles containing the edge e. We consider the following function on X m 

z (x) = X) 11^' 

ceCfc eGc 

which is the number of fc-cycles in a random graph G(n,p). In this case V(x) can be clearly 
written as 

n*)=5> e ( j2 n^') 2 - (L5) 

egS ceC fc (e)e'^e 

In this case, in order to use Theorem 1 to control the deviation of Z(x) above its median 
M(Z) (or its expectation EZ) we will proceed by showing how to control V(x) in terms of 
Z(x). 

We assume that for some large enough C{k) > 0, 

np > C{k) logn. (1.6) 

The following theorem holds. 

Theorem 2 If 4i.6)) holds then there exists a constant C(k) > that depends on k only 
such that 

V(x) > C{k)({np) k - 2 Z{x) + (np) 2 ^)) < exp(- 

/V Cyk) log log rap 

Theorems 1 and 2 will readily imply the following theorem. 

Theorem 3 If 111.6)) holds then, 

(1) For any e > there exists a constant C(k, e) that depends on k and e only such that 
the following holds 

If EZ > C(k,e) then M(Z) < (l + e)EZ. 

(2) There exists a constant C(k) > such that ifEZ > C(k) then 

(np) 2 



Z > 2EZ) < exp(- ™ ). (1.7) 

V C(k) log log np) 
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Recently the authors of [3] proved a more general result describing the deviations of the 
count of any subgraph in a random graph. In the case of fc-cycles their bound gives 

F(Z > 2EZ) < exp(-(np) 2 /C(k)). 

This shows that the factor log log np in (jl.7j) is unnecessary, but at the moment we don't see 
how to get rid of it using our approach. This has nothing to do with Theorem 1, since the 
factor log log np comes directly from Theorem 2 which, probably, can be improved. In the 
case of triangles (k = 3) the bound F(Z > 2EZ) < exp(— {np) 2 /C{k)) was also proved in j^j. 

2 Proof of Theorem 1. 

Talagrand's concentration inequality on the discrete cube is the main tool in the proof of 
Theorem 1. Let us recall it first. 

Given a point x G X m = {0, l}' m and a set A C X m , let us denote 

U A (x) = {(si)i< m G {0, l} m , ByeA 1 s i = 0=^y i = a*}. 

The "convex hull" distance between the point x and the set A is defined as 

f c {A, x) = inf{|s| : s G convU A (x)}, 

where \s\ is the Euclidean norm of s. The concentration inequality of Talagrand (Theorem 
4.3.1 in states the following. 

Proposition 1 For any t > 0, 

F(A)f(x G X m : f 2 (A,x) >t\< e- t/2 . (2.1) 

The main feature of this distance is that (Theorem 4.1.2 in jjj) 

m m ^ 

v(A,),< m lye a j2 x ^y^ x ^^f^ x \J2 x i) ■ ( 2 - 2 ) 

8=1 8=1 

Proof of Theorem 1. For a fixed number a G M consider a set 

A = {y G X m : Z(y) < a}. 

For a fixed x G X m and an arbitrary y G A, since Z(y) < a, we can write Z[x) — a < 
Z{x) — Z{y). Consider three sets of indices 

h = {i : Xi = 1, yi = 0}, J 2 = {i : Xi = 0, y { = 1}, J 3 = {i : x» = 

Without loss of generality we will assume that fx = {1, . . . , k} and J 2 = {fc + 1,. ..,/}. Define 
a sequence 

= (yi, . . . , yi, . . . , x m ), z = 0, . . . , m. 



4 



We have 

m m 

Z(x) - Z(y) = ^(Z^- 1 ) - Z(z«)) = J2( Z ^ ~ Z&WtPi * W). 

i=l i=l 

since a;, = yj (i.e. i > I) implies that Z(z l ~ 1 ) — Z(z l ) = 0. We have 

Z(z 1 - 1 ) - Z(z l ) = < Vi(x) for i = l + l,...,m, 

since for this range of indices z l ~ l = z\ 

Z^- 1 ) - Z(z l ) < < Vi{x) for i = k + l,...,l, 

since the function Z is non-decreasing in each coordinate and for i £ I2, z % ~ 1 — 0, z\ = 1 
and all other coordinates of z l ~ l and z l coincide, and 

Z{z i ~ x ) - Z{z l ) = Viiz*- 1 ) < Vi(x) for i = l,...,k, 

since for i 6 /j each coordinate of z* -1 is smaller than the corresponding coordinate of x. 
Thus we proved that for any y G A 

m 

Z(x) -a<^2 Vi{x)I{xi ^ yi). 

i=l 

By (|2.2jl there exists y & A such that the last expression can be bounded 

Vi(x)I( Xi + Vi ) < f c (A, x) (J2 = UA, x) 

1=1 1=1 

Talagrand's inequality (j2.1|) states that 

and, therefore, we finally get 

F(Z(x) >a + y/V(x)t)F(Z(x) < a) < e~ t/2 . 

□ 

3 Proof of Theorem 2. 

We will denote by d v the degree of a vertex v. Consider the sequence of sets 

V l = {v:d v < 16np}, Vj = {v : d v E [2 j+2 np, 2 j+3 np)}, j>2. (3.1) 
We will start by stating several basic facts that will be used in the proof of Theorem 2. 
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Lemma 1 // M.b\) holds then, 

f(3v : d v > (np) 2 ) < e - (np)2/2 . (3.2) 

Proof. For a fixed vertex v its degree d v is a sum of (n — 1) independent variables with the 
distribution /i. Using Bernstein's inequality one can easily check that for np > 4 

p(d v > (np) 2 } < e - {np) \ 

The union bound will produce a factor n and, therefore, using (jl.6|) implies ()3.2|) . 

□ 

Thus, with high probability we can assume that the degree of each vertex is bounded 
by (np) 2 and, therefore, we can only consider the sets Vj in (j3.1J) such that 2 J+2 < np and, 
therefore, j < lognp. 

Next we will bound the cardinality of each Vj. 

Lemma 2 ForC(k) > large enough we have, 

P(32 < j < lognp caxdVj > — ) < exp(- — ). (3.3) 

V j I 3 log lognp/ V G(/c) log lognp/ 

Proof. We copy the proof from jB] (see equation (12) in section 4.2 there). For a fixed j > 2, 
assume that 

cardK- > r = — - — . 

j2-? log lognp 

In this case, there exists a set of r vertices each with degree at least 2 J+2 np. It implies that 
the number of edges containing exactly one of these vertices exceeds 

(2 J+2 np — r)r > 2 J+1 npr, 

The probability that such a set of edges exists is bounded by 

(n\(r(n — r)\ nj+i^ ( , en „■,-, , er(n — r) 

^ P P < exp r log — + 2 J+1 npr log — ^ - + 2 J+1 npr logp 

\r J \2 J+i npr J \ r 2 1+l npr 

< exp r log — + 2 J+1 npr log — — ) < exp (-7777^ — i 

V r 2-? +i / V C(k) log lognp 



where in the last inequality we used the estimate 

03+1 1 e / (^P) 2 1 1 / (^P) 2 

2 J+i 2] log lognp 2-? 1 G (A;) log lognp 

and the first term rlog(en/r) was negligible compared to the second term. Taking the union 
bound over j < lognp, we get a factor lognp in front of the exponent that can be ignored 
by increasing C(k). 

□ 
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Before we will state our next lemma, we need to make one remark about the proof of 
Theorem 2. Multiplying out the right-hand side of ()1.5|) we observe that V(x) can be written 
as a sum of terms 

x e Y\ x e i Y\ x e', where e E E , c, d E Ck(e). 

e'dc e'Gc' 

Each of these terms may appear several times, but, clearly, the number of appearances will 
be bounded by C(k) that depends on k only. Each of these term represents two cycles that 
have at least one edge in common. There are many different isometric configurations of such 
two cycles but, clearly, the number of them is bounded by a constant that depends on k 
only. Hence, V(x) can be decomposed into the sum of the counts of such pairs of cycles over 
different configuration. With minor modifications it is possible to prove the statement of the 
theorem for each of these configuration. 

We will only look at the pairs of cycles that have exactly one edge in common. Let us 
denote the number of such pair by W(x). We will identify each pair of cycles with an injection 
a : {1, . . . , 2k - 2} -> V(G), such that (a(l), a (2) . . . , a(k)) and (a(k), a(k + l),..., a(l)) 
are the ordered vertices of these two cycles, and a(l)a(k) is their only common edge. Let us 
denote the set of these injections by Eq. 

Lemma 3 There is a partition of vertices V(G) — Fi U . . . U F2k-2 such that 

W(x) < C(£;)card{(x E E : Vi a(i) E F { }. (3.4) 

Proof. See Proposition 1.3 in 

□ 

Let us denote the set in the statement of Lemma 3 by 

E = {a E S : \/i a{i) E F}. (3.5) 

Proof of Theorem 2. By Lemma 3 and the discussion preceeding Lemma 3, all we need 
to do is to estimate the cardinality of E in ()3.5j) . Let us consider the event 

S = {VJ > 2 cardV^ < [J{V» : i. < (npf}. 

By Lemma 1 and Lemma 2 this event holds with probability at least 

(np) 2 



exp 



C(k) log log np 



for some C(k) large enough. From now on we assume that this event occurs. For each vertex 
v E Fx let us denote 

Si{v) = {HI), . . . , ail)) : a E E, a(l) =v}, l> 1. 
Let us denote d„ = m&x(d v , np). We will prove that if £ occurs than for / > 2 

caxdSi{v) < C{k)d+{np) 1 - 2 . (3.6) 
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F{k} 



F{2k-2^\ /gjFjk+ll F{2k-2)(^)__ Q F{k+1} 

Figure 1: (1) Counting a e Ei. (2) Counting cr e E 2 . 



For / = 2, this obviously holds with C(k) = 1. We proceed by induction over /. Let us 
decompose 

where 

S>(v) = {((7(1), . . . , a(Z)) G 5,( W ) : a(Z - 1) G V-}. 

To bound the cardinality of cardS* (i>) we use the induction hypothesis and the fact that 
the degree of each vertex in the set V\ is bounded by 16np. We get 

caxdS", 1 (u) < (l&np) caxdSj_i(u) < C(k)d+(np) 1 - 2 . 

To bound the cardinality of Sf (v) for j > 2 we notice that for each (cr(l), . . . , cr(l)) G S 1 / (i>), 
we have (cr(l), . . . , cr(Z — 2)) G Si_ 2 (u) and er(Z — 1) G V^. On the event £ we can control the 
cardinality of Vj and, moreover, the degree c^z-i) < 2 j+3 np. For Z = 3, since cardS'i(f) = 1, 
we get 

caxdSg(u) < (cardl^) (2 i+3 np) < ^ 2 j+3 np < S(np) 2 - ' 



2 J j log log np j log log np 

and since on the event S we can assume that 2 J+2 < np, which implies that j < \ognp, we 
get 

log np log np 

caxdSg'(u) < 8(np) 2 jloglognp ^ C{k)(np) 2 < C(k)d+(np), 

and this proves the induction step for I = 3. The last inequality explains the appearance of 
the factor log log np in Theorem 3. Similarly, for I > 4 we get 

cardS/ (v) < cardSj_ 2 (v) (cardV}) (2 j+3 np) 
< C(k)d+(np) 1 - 4 -—^- 2^' +3 np < C(k)d+(np) 1 - 2 ' 



2>j log log np v j log log np 
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and 

log np log np 

Y caxdS{ (v) < C(k)d+(np) 1 - 2 V — < C{k)d+{np) 1 - 2 . 

This completes the proof of the induction step and (|3.6)l . 

To estimate the cardinality of E we will decompose it into E = Ei U E 2 , where 

Ei = {a G E : (7(1) G 7 X }, E 2 = {cr G E : <r(l) G V$, 2 < j}. 

We will estimate the cardinality of Ei and E 2 differently (the idea is illustrated in Figure 
1). First of all, since we can control the cardinality of Vj for j > 2, we will simply use (j3.6|) 
for I = 2k — 2 to compute the number of different paths from v G Fx to F 2k _ 2 , and then add 
them up. This will give us the bound on cardinality of E 2 . On the other hand, for o G E x we 
can represent it as a cycle on F\, F k , . . . , F 2k - 2 and a path from Fi to F k -\. In this case, the 
number of cycles is bounded by Z(x), and to bound the number of paths we again use ()3.6|) . 
Let first estimate the cardinality of E 2 . First of all by ()3.6|) for each vertex v G Vj D F\ 

c&TdS 2k -2(v) < C(k)2^(np)(np) 2k -\ 

and, therefore, on the event £ , 

card{a G E : cr(l) G V$} < C(A;)2 J+3 (np) 2/c - 3 (cardV i ) < C(kp +3 (np) 2k - 3 "'' 



j2i log log np 

When we add up these injections over j > 2 we get 

log np 

y C(kp +3 (np) 2k - 3 —^- < C(k)(np) 2k - 2 . 

j— ^ 

This accounts for the second term in the bound of the theorem. 

Now consider all injections o such that o~(l) G V\. Consider the trace of the set of images 
of the injections from E (in other words, pairs of cycles) on the set 

(Vx n Fx) U F k U F k+l U . . . U F 2k _ 2 , 

i.e. 

P = {(a(l), a{k), a(k + l),..., a(2k - 2)) : a G E, a(l) G VJ. 

First of all, the cardinality of "P is bounded by Z(x), since V can be identified with the 
subset of all cycles in the random graph. Moreover, for each (vi, v k , Ujfc+i, • • • , v 2k - 2 ) G the 
number of injections a G E such that cr(l) = v i, cr(fc) = f fc , . . . , cr(2A; — 2) = t> 2fc _ 2 is bounded 
by cardS'fc_i(f i), since all values of the injection are fixed except for cr(2), . . . , a{k — 1). But 
since V\ G V\ implies that the degree d vi < IQnp and, thus, d+ < 16np, we have by (|3.6j) 

cardS fc _i(vi) < C(k){np){np) k - 3 < C{k){np) k - 2 . 

Therefore, the cardinality of all injections such that a(l) G V\ is bounded by 

C(k)(cwdV){np) k - 2 < C(k)Z(x)(np) k - 2 , 

which accounts for the first term in the statement of the theorem. 
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4 Proof of Theorem 3. 

Theorem 2 implies that for any a£l and t > 0, 

1 /2 

Z < a + Wtj < W>(z <a + (C{k)((np) k - 2 Z + (np) 2(fc-1) )t N 



+P(F > C(fc)((np) fc Z+ (np) 2fc ) 

V2\ / (np) : 



C(/c) log lognp 



< p(Z < a+ (C(k)((np) k - 2 Z + (np) 2 ^)^ \ + exp( 
which implies that 

p(z > a+ (C{k)({npf- 2 Z + (np) 2ik -^)tj ) < pjz^a + v 7 ! 7 * 

/ (np) : 
+ expl 



C(k) log lognp 

Multiplying both sides by P(Z < a) and using Theorem 1 with t = 2e(np) 2 we get 

1/2 



Z > a + (c(A;)£((np) fe Z + (np) 2k )^j Wz < a 

(np) 2 
C(k) log log np 



< P(Z < a)exp(- (WP)2 ) +e-^) 2 . (4.1) 

V C ( /c log log no/ 



If we take 

a = M- ^C{k)e{{np) k M + {np) 2k ) (4.2) 

then, clearly, the following two events are equal 

{Z > a + v/C(fc)£((np) fc Z + (np) 2fc )} = {Z > M}, 
and, therefore, (j4.1j) implies that for np > C{k) for large enough C(k) > 0, 

pfz < M - ^C(k)s((np) k M + (np) 2fc )) < 3e" e(np)2 . 

Since EZ > aP(Z > a), the choice of a as in (J4.2j) gives 

M - ^JC(k)e((np) k M + (np) 2k ) < EZ f 1 - 3e- £(np)2N 



This, clearly, implies the first statement of Theorem 3. 

To prove the second statement we use (|4.1|) with a = M, and assume that np is large 
enough, so that M < (1 + e)KZ. Then with probability at least 



1 _ 2e- £(np)2 -exp(- 



(np)' 



C(k) log log np 
we have 



Z < (1 + e)EZ + ^C{k)e{{np) k Z +{np) 2k ). 
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Since EZ ~ (np) k , for small enough e this implies that Z < 2~EZ, which completes the proof 
of the second statement of Theorem 3. 

□ 
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